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25 Sir: 

Applicants hereby appeal the final rejection dated September 9, 2005, of 
claims 1 through 12 of the above-identified patent application. 

30 REAL PARTY IN INTEREST 

An assignment was filed on November 4, 2005, assigning the present 
application to International Business Machines Corporation. The assignee, International 
Business Machines Corporation, is the real party in interest. 

35 RELATED APPEALS AND INTERFERENCES 

There are no related appeals and interferences. 
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STATUS OF CLAIMS 
Claims 1 through 12 are pending in the above-identified patent 
application. Claims 1, 2, 7, 8, 10, and 11 remain rejected under 35 U.S.C. §102(b) as 
being anticipated by Braden-Harder et al. (United States Patent Number 5,933,822) and 
5 claims 3-6, 9, and 12 remain rejected under 35 U.S.C. § 103(a) as being unpatentable over 
Braden-Harder et al. in view of Diamond (United States Patent Number 6,269,368). 

STATUS OF AMENDMENTS 
There have been no amendments filed subsequent to the final rejection. 

10 

SUMMARY OF CLAIMED SUBJECT MATTER 
The present invention is directed to automated question answering that 
relates to the selection of an answer to a question from a pool of potential answers which 
are manually or automatically extracted from a large collection of textual documents 

15 (page 5, line 18, to page 7, line 17). The feature extraction component, a feature 
combination component, an answer selection component, and an answer presentation 
component, among others, are included (page 7, line 18, to page 9, line 6). The input to 
the system is a set of one or more natural language questions and a collection of textual 
documents (page 9, lines 15-22). The output is a (possibly ranked) set of factual answers 

20 to the questions, these answers being extracted from the document collection (page 9, line 
23, to page 10, line 5). 

STATEMENT OF GROUNDS OF REJECTION TO BE REVIEWED ON APPEAL 

Claims 1, 2, 7, 8, 10, and 1 1 are rejected under 35 U.S.C. §102(b) as being 
25 anticipated by Braden-Harder et al. and claims 3-6, 9, and 12 are rejected under 35 
U.S.C. §103(a) as being unpatentable over Braden-Harder et al. in view of Diamond. 
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ARGUMENT 
Independent Claims L 7 and 12 

Independent claims 1 and 7 were rejected under 35 U.S.C. § 102(b) as 
being anticipated by Braden-Harder et al. and claim 12 was rejected under 35 U.S.C. 
5 § 1 03(a) as being unpatentable over Braden-Harder et al. in view of Diamond. 

Regarding claims 1 and 7, the Examiner asserts that Braden-Harder 
discloses presenting the best scoring possible answer to the user with context from the 
passage containing the answer. Regarding claim 12, the Examiner asserts that Diamond 
teaches scoring each possible answer phrase, selecting one or more of the best scoring 

10 answer phrases, and displaying the answer phrases to the user. In the Response to 
Arguments section of the final Office Action, the Examiner asserts that the Applicants' 
argument that "Braden-Harder does not attempt to present answers with context from a 
relevant passage" is not claimed, and that the invention as claimed does not limit the 
"candidate list of passages of possible answers" to a phrase, a sentence, a paragraph, or a 

15 document. The Examiner further asserts that a "pool of potential answers... extracted 
from a large collection of textual documents" cannot be limited to a "snippet." 

Applicants note that the present invention is directed to selecting answers 
to natural language questions "from a pool of potential answers which are manually or 
automatically extracted from a large collection of textual documents." (Page 1, lines 9- 

20 11.) An "answer" is defined as "a statement (either spoken or written) that is made in 
reply to a question or request or criticism or accusation." (See, dictionary.com) Thus, 
answers are not documents but are short snippets or "snippets of text (e.g., phrases) 
which provide the exact answer to the question' 7 (See, definition on page 1, lines 21-23, 
of the present disclosure.) Although Braden-Hader utilizes natural language processing 

25 to identify documents that are relevant to a query, Braden-Harder discloses that the 
processor "presents the retained documents to the user rank-ordered based on their 
score." (Col. 8, lines 2-4; emphasis added.) As with most conventional search engines, 
Braden-Harder simply returns documents. Braden-Harder does not attempt to present 
answers, as defined in the present invention and as would be understood by a person of 

30 ordinary skill in the art, and does not attempt to present answers with context from a 
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relevant passage. Independent claim 1 requires "presenting the best scoring possible 
answer to the user with context from the passage containing the answer." Thus, contrary 
to the Examiner's assertion, presenting answers with context from a relevant passage is 
claimed in the present disclosure. In addition, independent claim 7 requires an answer 
5 selection module and an answer presentation module, and independent claim 12 requires 
scoring t( each possible answer phrase, selecting one or more of the best scoring answer 
phrases, and displaying the answer phrases to the user." Braden-Harder does not 
disclose or suggest any of these limitations. 

Applicants also note that Diamond is directed to a system and method for 

10 improving information retrieval effectiveness by dynamically combining evidence 
information produced by a plurality of retrieval systems matching alternative 
representations of queries and documents. (See, Abstract.) As with most conventional 
search engines, Diamond simply returns documents. Diamond does not attempt to 
present answers, as defined in the present invention and as would be understood by a 

15 person of ordinary skill in the art, and does not attempt to present answers with context 
from a relevant passage. ^ 

Thus, Braden-Harder and Diamond, alone or in combination, do not 
disclose or suggest presenting the best scoring possible answer to the user with context 
from the passage containing the answer, as required by claim 1, do not disclose or 

20 suggest an answer selection module and an answer presentation module, as required by 
independent claim 7, and do not disclose or suggest scoring each possible answer phrase, 
selecting one or more of the best scoring answer phrases, and displaying the answer 
phrases to the user, as required by independent claim 12. 
Claims 5 and 9 

25 Claims 5 and 9 are rejected under 35 U.S.C. § 103(a) as being unpatentable 

over Braden-Harder et al. in view of Diamond. In particular, the Examiner asserts that 
Braden-Harder discloses that "the parameters of the scoring function are learned by a 
machine learning algorithm." 

Applicants note that the text cited by the Examiner is directed to "the 

30 manner through which our invention (Braden-Harder) compares and weights matching 
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logical form triples, and ranks corresponding documents." (Col. 17, lines 5-7.) 
Applicants could find no disclosure or suggestion by Braden-Harder, however, that "the 
parameters of the scoring function are learned by a machine learning algorithm." 

Thus, Braden-Harder and Diamond, alone or in combination, do not 
5 disclose or suggest the parameters of the scoring function are learned by a machine 
learning algorithm, as required by claims 5 and 9. 



Conclusion 

The rejections of the cited claims under sections 102 and 103 in view of 
10 Braden-Harder and Diamond, alone or in combination, are therefore believed to be 
improper and should be withdrawn. The remaining rejected dependent claims are 
believed allowable for at least the reasons identified above with respect to the 
independent claims. 

The attention of the Examiner and the Appeal Board to this matter is 

1 5 appreciated. 

Respectfully, 



20 Date: December 2, 2005 Kevin M. Mason 

Attorney for Applicant(s) 
Reg. No. 36,597 
Ryan, Mason & Lewis, LLP 
1300 Post Road, Suite 205 

25 Fairfield, CT 06824 

(203) 255-6560 
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APPENDIX 

1. A method for selecting answers to natural language questions from a 
collection of textual documents comprising the steps of: 

5 extracting scoring features from a candidate list of passages of possible 

answers; 

scoring the possible answers using the extracted features and a feature 
scoring function; and 

presenting the best scoring possible answer to the user with context from 
10 the passage containing the answer. 

2. A method as in claim 1, wherein the features used to score possible 
answers consists of one or more of the following features: a semantic type of a current 
suspected answer, a position of the suspected answer among all suspected answers within 

15 all document passages, a position of the suspected answer among all suspected answers 
within the given passage, a number of suspected answers of a given semantic type 
retrieved within a given passage, a number of words in a suspected answer that do not 
appear in the user question, a position of the semantic type in the list of potential 
semantic types for the question, an average distance in words between the beginning of 

20 the potential answer and the words in the question that also appear in the passage, a 
passage relevance as computed by the information retrieval engine, a frequency of a 
given potential answer on the list, a semantic relation between words from the question 
and words from the potential answer, and a strength score that is a function of the 
relevance score. 

25 

3. A method as in claim 2, wherein the feature scoring function is a linear 
combination of weighted features. 

4. A method as in claim 3, wherein the parameters of the scoring function 
30 are manually determined. 
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5. A method as in claim 3, wherein the parameters of the scoring function 
are learned by a machine learning algorithm. 

6. A method as in claim 1 where the candidate list of passages of possible 
5 answers is obtained from the collection of documents using an information retrieval 

engine 

7. A computer system that extracts answers to natural language questions 
from a large collection of textual documents consisting of one or more of the following 

10 modules: 

a feature extraction module; 

a feature combination module, containing a "feature extraction" and 
"compute composite score" components; 

an answer selection module; and 
15 an answer presentation module. 

8. A computer system, as in claim 7, wherein the feature extraction 
module extracts one or more of the following features: a semantic type of the current 
suspected answer, a position of the suspected answer among all suspected answers within 

20 all document passages, a position of the suspected answer among all suspected answers 
within the given passage, a number of suspected answers of a given semantic type 
retrieved within a given passage, a number of words in an suspected answer that do not 
appear in the user question, a position of the semantic type in the list of potential 
semantic types for the question, an average distance in words between the beginning of 

25 the potential answer and the words in the question that also appear in the passage, a 
passage relevance as computed the retrieval engine, a frequency of a given potential 
answer on the list, a semantic relation between words from the question and words from 
the potential answer, and a strength score that is a function of the relevance score. 
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9. A computer system as in claim 7, wherein the feature combination 
module employs a feature scoring function with parameters learned by a machine 
learning method. 

5 10. A computer system as in claim 7, wherein the answer selection module 

selects the answer with the best score obtained from the feature combination module. 

11. A computer system as in claim 7, wherein the answer presentation 
module shows the top scored answer within the context as specified by a user or a 

10 system. 

12. A computer program product that performs the steps of: 
determining a feature scoring function during a training phase either 

manually or via a machine learning algorithm applied to a set of training questions, 
15 corresponding answer passages, and certain extracted features; and 

during an execution phase, extracting certain features from questions and 
corresponding possible answer phrases, applying the feature scoring function determined 
during the training phase to score each possible answers phrase, selecting one or more of 
the best scoring answer phrases, and displaying the answer phrases to the user with 
20 optional additional context from the answer passages. 
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EVIDENCE APPENDIX 
There is no evidence submitted pursuant to § 1.130, 1.131, or 1.132 or 
entered by the Examiner and relied upon by appellant. 
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RELATED PROCEEDINGS APPENDIX 
There are no known decisions rendered by a court or the Board in any 
proceeding identified pursuant to paragraph (c)(l)(ii) of 37 CFR 41.37. 
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